C ONSTRAINT - B ASED M INING AND L EARNING AT ECML / PKDD 2007 CMILE ’ 07 September 21 , 2007 Warsaw , Poland

نویسندگان

  • Luc De Raedt
  • Siegfried Nijssen
  • Marzena Kryszkiewicz
  • Hiroki Arimura
  • Hendrik Blockeel
  • Francesco Bonchi
  • Jean-François Boulicaut
  • Toon Calders
  • Ian Davidson
  • Saso Dzeroski
  • Peter Flach
  • Minos Garofalakis
  • Thomas Gärtner
  • Fosca Giannotti
  • Bart Goethals
  • Jiawei Han
  • Tomer Hertz
  • Kristian Kersting
  • Ross D. King
  • Joost N. Kok
  • Stefan Kramer
  • Tilman Lange
  • Taneli Mielikäinen
  • Jan Struyf
  • Rosa Meo
  • Shinichi Morishita
  • Céline Robardet
  • Arno Siebes
  • Kiri Wagstaff
  • Takashi Washio
  • Philip S. Yu
  • Xifeng Yan
  • Mohammed Zaki
  • Saher Esmeir
  • Shaul Markovitch
چکیده

Machine learning techniques are increasingly being used to produce a wide-range of classifiers for complex real-world applications that involve different constraints both on the resources allocated for the learning process and on the resources used by the induced model for future classification. As the complexity of these applications grows, the management of these resources becomes a challenging task. In this work we introduce ACT (Anytime Cost-sensitive Tree learner), a novel framework for operating in such environments. ACT is an anytime algorithm that allows trading computation time for lower classification costs. It builds a tree top-down and exploits additional time resources to obtain better estimations for the utility of the different candidate splits. Using sampling techniques ACT approximates for each candidate split the utility of the subtree under it and favors a split with the best evaluation. Due to its stochastic nature ACT is expected to be able to escape local minima, into which greedy methods may be trapped. ACT can be applied in two anytime setups: the contract setup where the allocation of resources is known in advance, and the interruptible setup where the algorithm might be queried for a solution at any point of time. Experiments with a variety of datasets were conducted to compare the performance of ACT to that of the state of the art decision tree learners. The results show that for most domains ACT produces significantly better trees. In the cost-insensitive setup, where test costs are ignored, ACT could produce smaller and more accurate trees. When test costs are involved, the ACT trees were significantly more efficient for classification. ACT is also shown to exhibit good anytime behavior with diminishing returns. S. Esmeir and S. Markovitch. Anytime Learning of Decision Trees. Journal of Machine Learning Research (JMLR), 8, 2007. S. Esmeir and S. Markovitch. Occam’s Razor Just Got Sharper. In Proceedings of The 20th International Joint Conference on Artificial Intelligence (IJCAI-2007), 2007. S. Esmeir and S. Markovitch. Anytime Induction of Decision Trees: An Iterative Improvement Approach. In Proceedings of The 21st National Conference on Artificial Intelligence (AAAI2006), 2006. S. Esmeir and S. Markovitch. When a Decision Tree Learner Has Plenty of Time. In Proceedings of The 21st National Conference on Artificial Intelligence (AAAI-2006), 2006. S. Esmeir and S. Markovitch. Lookahead-based Algorithms for Anytime Induction of Decision Trees. In Proceedings of the 21st International Conference on Machine Learning (ICML-2004), 2004. Constraint Based Hierarchical Clustering for Text Documents

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Light microscopic staining and histochemical applications for plant meristems embedded in Spurr's medium.

A simple and convenient procedure was u sed for studying plant m erist ematic tissues. Tissues were fixed in 4% glutaraldehyde, dehydrated in a t ertiary butyl alcohol series and embedded in SPURR's m edium. Sectioning may be done with a glass knife in an inexpensively made holder fitted t o a rotary microtome. Sta ining with 0.5 % aqueous safranin or fast green was followed by counter st a ini...

متن کامل

Corporate E-learning Strategy

Not th e leas t factor , when lau nching th e corp orate tr aining, is str ategy dete rmination. On the one hand, an e ducational institution has to follow the mar ket and be able to satisfy the demands of corporate clients and companies. I t is a s o called market adaptation. However, th is approach implies a number of shortcomings. An excessive interest in mar ket ad aptation may l ead to lop...

متن کامل

A Formal Grammar Of Expressiveness For Sacred Legends

A formal grammar accounting for a r t i s t i c a b i l i t i e s of a fo lk story t e l l e r is proposed. This grammar comprises the fol lowing components: I ) a grammar def ining the set of formulae representing steps of tex t generation; 2) a l i s t of "messages" underlying the texts of a f o l k lore genre; 3) a l i s t of " ru les of embell ishment" serving to attach a r t i s t i c qua ...

متن کامل

STUDIES ON NUCLEIC ACID METACHROMASY II. Metachromatic and Orthochromatic Staining by Toluidine Blue of Nucleic Acids in Tissue Sections

Acrolcin-fixed, polyester wax-cmbcddcd tissue sections showed excellent preservation of light microscopic architecture and, when stained with toluidinc blue, intense color contrast between DNA, which stained orthochromatically, and RNA, which stained metachromatically. This method has practical value for differentiating DNA from RNA in the same section. The color c~)ntrast was impaired by subst...

متن کامل

STUDIES ON NUCLEIC ACID METACHROMASY II. Metachromatic and Orthochromatic Staining by Toluidine Blue of Nucleic Acids in Tissue Sections

Acrolcin-fixed, polyester wax-cmbcddcd tissue sections showed excellent preservation of light microscopic architecture and, when stained with toluidinc blue, intense color contrast between DNA, which stained orthochromatically, and RNA, which stained metachromatically. This method has practical value for differentiating DNA from RNA in the same section. The color c~)ntrast was impaired by subst...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007